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FIG. 7A 



IGGTCTCTC7GG7TAGACCAGATTTCAGCCTGCGAGCTCTCTGGC7AGC7AGGCA.ACCCAC 

TGCTTAAGCCTCAATAAAGCTTGCCTTGAGTGCTTCAAIGTAGTGTGTGCCCGTCTGTTGT 

100 

G7GACTC7GGTAAC7AGAGATCCC7CAGACC CC777AGTCAGAG7GGAAAA7C7C7AGCA 



CCCCCCGAACACGGACCTGAAACCGAAAGTACAACCAGAGGAGCTCTCTCGACGCaG 
200 . 

AC7CGGC77GCTGAAGCGCGCACGCCAAGAGGC3A<?GS6CAGCGACTGGTSA<5T.ACGC7 

f-^GAG . 300 

» M e c G 1 y A I a A r s A I aiS e r V c 1 1 e S e 

.AAA! 7 7 T 7G A C 7 AG CG G AG G C 7 AG A AG GAG AG A G AIGGGTG CGAGAGCG7CAG 7A7 7AA 



GlyGlyLvsLeuAspLysTrpGluLysIleArgLeuArsProGlyGlyLysLysLvsTy: 
G C G G GG GAAAA77 AG A7 AAA7G GGAAAAAA7 7CGG7TACGGCCAGGAG GAAAG A A AA AA7 

400 

ArsLeuLysKisIleValTrDAUSerAr;GluLeuGluArgTyrAlaLeuAsr.?roGly 
ATAG AC 7AAAAC ATA7AG 7A7G G G C AAG C AG GG AG C 7AG AACGA7 A7G C AC 7T AA7C C 7G 

LeuLeuGluTbrSerGluGlyCysLysGlnlLelleGLyGULeuGlnProAlalUGln 
G C C 77 77 AG A A AC A7 C AG AA GGC7G7AAACAAA7AATAG GGCAGCTACAAC C AG C7AT T C 
500 .... 
ThrGlyThrGluGluLeuArgSerLeuTyrAsnThrValAlalhrLeuTyrCysValais 
AG A C AG G AAC AGAAG AA C 77AG A7C A77A7A7AA7ACAG7AG CAACC C 7C 7A 7 7G 7G 7 AC 

600 

LysGlylleAspValLysAspThrLysGluAlaLeuGluLysiletGluGLuGluGlaAsn 
A7 A AAGG AA7 AG A7G7AAAAG AC AC C AAG G A AG C 77 7 AG AAAAG ATGG AG G AAG AG C AAA 

LysSerLysLysLysAlaGltiGlnAlaAlaAlaAspTlirG lyAsnAs nS e rGlnValSer 
AC AAAAGTAAGAAAAAGGCAC AGC AAG C AG CAGC7GACACAGGAAACAACAGC C AGG7C A 

700 

GlnAsnTyrProIleValGlnAsnLeuG laGlyGlnMe t Va 1H i sG 1 n A 1 a I le SerPro 
GCC AAAA77ATCC7ATAGTGCAGAACC7AC AGGGGCAAA7GGTAC ATCAGGC C ATATC AC 

ArgThrLeuAsnAUTrp ValLy sValtleG LuGluLysAlaPheSerProGluVallle 
C7AGAACT77GAACGCA7GGG7AAAAG7AA7AGAAGAAAAGGCT7TCAGCCCAGAAG7AA 
800 .... 
ProHetPheSerAlaLeuSerGluGlyAlaThrProGlnAspLeuAsnThrMecLeuAsn 
7ACCCA7G777TCAGCA7TATC AG AAGGAG CCACC CCACAAG A777AAACACC A7G CTAA 

900 

ThrVa IGlyGlyHisGlnAUAlaMetGlnMetLeuLysGluThrlleAsnGluGluAla 
ACACAGTGGGGGGACATC AAG C AG C C A7G C AAA7GC7AAAAG AGACCATC AATG AAG AAG 

AlaGluTrpAspArgLeuEisProValHisAlaGlyProIleAlaProGlyGlnllecArg 
C7GCAG AATGGGATAG GTTACATCC AGTGCATGCAGGGCCTATTG CACCAGGCC AG ATG A 

1000 

GluProArgGlySerAspIleAlaGlyThrThrSerThrLeuGlnGluGlnlleAlaTrp 
G AG AACC AAGGGGAAGTGAT AT AGC AGGAACTACTAGTACCCTTC AGGAAC AAATAGC AT 

• ••••• 

MetThrSerAsaProPtoIlePtoValGlyGLuIleTyrLysArsTtpIlelleValGly 
GGATGACAAGTAAC CCACCTATCCC AGTAGGAG AAATCTATAAAAGATGGATAATTGTGG 
1100 . 

LeuAsnLysIleValArgMe tTyr SerPro ValSer I I eLeuAs p II e Ar gG InG lyPro 
GATTAAATAAAATAGTAAG AATG TATAG C C CTGTCAG CATTTTGGACATAAG AC AG GGAC 

1200 



FIG. 7B 



LysGluProPheArgAspTyrValAspArgPheTyrLysThrLeuArsAlaGluGlnAla 
C AAAGGAACCTTTTAGAGAC TATGTAG AC C G CTTCTAT AAAAC T C TAAG AG C C GAG C A AG 

SerGlnAspValLysAsnTrpIIetThrGluThrLeuLeuVaLGlnAsnAlaAsnProAs? 
CTTCACAGGATGTAAAAAATTCGA7GACAGAAACCTTG TTGGTCCAAAATGCAAACCCAG 

1300 

CvsLvsThrlleLeuLysAlaLeuGLyProGlnAlaThrLauGluGluIIeClIecThrALa 
AT I G C A A G A C T A T C XT AAA AG CATTGGGACCACA.GGCTACACTAGAAGAAATGATGACAG 

CysGLnGlyValGlyGLy?roSer!IisLysAUAr;VsileuAlaGluAUi:ecSerGin 
C ATG T C AG G G AG TG G G G GG GC C C AG CCATAAAG CAAGAGTT C TG G C TG AG G C AATG AG C C 
1400 . . . ... 

AUThrAsnSerValThrThrAla:!etHetGlKAr;GlyAsn?heLysGly?roArgLys 
AAGCAACAAATTCAGTTACTACAGCAATGATG C AG AG AG G C AATT T T A AG GGCCCAAGAA 

1500 

IlelleLysCysPheAsnCysGlyLysGluGlyHisIleAULysAsaCysArgAlaPro 
AAAT TAT TAAG TG TTTCAATTG TG G C A AAG A AG G G C AC AT AG C AA AAAATTG CAGGGCCC 

ArgLysLysGlyCysTrpArgCysGlyLysGluGlyHtsGinLeaLysAspCysThrGlu 
C TAG G AAAAAGG G CTGTTGGAGATGTGGAAAG G AAG GAC AC C AAC TA AAAG ATTG C AC TG 

. ^POL • • 1600 

JPhePheArgGluAsnLeuAlaPheProGlttGlyLysAlaGlyGluLeu 
ArgGlnAlaAsnPheLeuG LyArg I leTr pPr o S e rH i s Ly sG lyAr gP r oG 1 yAs nPh e 
AG AG A C AGG CTAATTTTT TAG GGAG AAT TTGG C CTTCCCACAAGG GAAG G C C GGGG AAC T 

SerProLysGlnThrArgAUAsnSerProThrSerArgGluLeuArgValTrpGlyAr; 
LeuGlnSerArgProGluProThrAlaProProAUGluSerPheGlyPbeGlyGluGlu 
TTCTCCAAAGCAGACCAGAGCCAACAGCCCCACCAGCAG AGAGCTTCGGGTTTGGGGAAG 
1700 .... 
AspAsaProLeuSerLysThrGlyALaG luArgG UG lyThrValSerPheAsnPhePro 
IleThrProSerGlnLysGlnGluGlaLysAspLysGluLeuTyrProLeuThrSerLeu 
AGATAACCC C CT C TC AAAAAC AGG AG C AG AAAG ACAAG GAAC TG T AT CCTTTAACTTC C C 

•GAG- • - 1800 

GlnlleThrLeuTrpGlnArgProLeuValAlalleLysIleGlyGlyGlnLeuLysGlu 
LysSerLeuPheG lyAsnAs pProLeuSerGl n\ 
TCAAATCACTCTTTGGCAACGACCCCTTGTCGCAAfrAAAAATAGGGGGACAGCTAAAGCA 

AlaLeuLeuAspThrG 1 yAl a As pAs pthr Va 1 LeuG luG lulle cAs nLeuPr oG lyLys 
AG CTC TATTAG ATACAG G AGC AGATGATAC AGTATTAG AAGAAATG AATTTG C C AG GAAA 

1900 

TrpLy sProLyslIet I leGlyGlylleGlyGlyPhelleLysVaUrgG InTyrAspGIa 
ATGGAAACCAAAAATG ATAGGGGGAATTGGAGGTTTTATCAAAGTAAG ACAGTATGATCA 

IleProIleGluIleCysGlyGlnLysAlalleGlyThrValLeuValGlyProThrPro 
AATACCCATAGAAATCTGTG GAC AG AAAG CTATAGGTAC AGTATTAG TAG GACCTACGCC 
2000 .... 
•ValAsnllelleGlyArgAstiLeuLeuThrG lnlleG 1 y Cy s Th r L euAs nP h e? ro I 1 e 
TGTCAACATAATCGGAAGAAATTTGTTG ACCCAGATTGGCTGCACTTTAAATTTTCCAAT 

2100 

SerProIleGluThrValProValLysLeuLysProGlyMetAspGlyProLys ValLy s 
TAGTCCTATTGAAACTGTACC AGTAAAATTAAAGCCAGGAATGGATGGCC CAAAAGTTAA 
• ••••• 

GlnTrpProLeuThrG luGluLysIleLysAULeuThrGluIleCysThrAspHetGlu 
AC AATGGC C ATTG AC AG AAG AAAAAAT AAAAG C ATTAAC AG AAATTTGTAC AG AT ATG GA 

2200 



FIG. 7C 



LysGluGlyLysIleSerArglLeGlyProGluAs nProTyrAsnThrFr oIlePheAla 
AAAGGAAGG AAAAATTTC AAGAATTGGGCCTGAAAATCCATACAATACTCCAATATTTG C 

IleLysLysLysAspSerThrLysTrpArgiysLeuValAspPheArgGluLeuAsr.Lys 
C ATAAAGAAAAAAG AC AGTACC AAGTGG AG AAA.ATTAG TAG ATTTC AG AG AAC 7TAA TAA 
2300 .... 
ArgThrGlnAspPheTrpGLuValGlnLeuGLylleProRLsProAlaGIyLeuLysLys 
GAGAAC7CAAGATTTCTCGGAAGTTCAATTAGGAATACCGCATCC7G CAGGGCTC AAAAA 

2400 

LysLysSerValThrValLeuAspValGlyAspAlaTyrPheSerValProLeuAspGlu 
GAAAAAATCAGTAACAGTACTGGATG7GGGTGATGCATATTTTTCAGTTCCCT7AGATGA 

Asp?heArgLys7yr7hrALaPheThrIleSerSerIleAsaAsnGlu7hrProGlyIle 
AG ATTT7AGG AAATATAC C GCCTTTACCATATC TAG TATAAACAAT G AG AC AC C AG G G AT 

2300 

ArgTyrGlnTyrAsnValLeuProGlnGlyTrpLysGlySerProAiallePheGlnSer 
T AG AT ATC AGTAC AATG 7G C 77 C C AC AG GG A7G GAAAG GA7 C AC C G G CAA7A7TC C AA AG 

SerlletThrLysIleLeuGluProPheArgLysGinAsnProGluMetVaLIleTyrGln 
7AGCA7G AC AAAAATC77AGAGCC C777AGAAAACAAAA7CCAGAAATGG77A7C7A7CA 
2600 .... 
TyriUcAspAspLeuTyrValGlySerAspLeuGluIleGlyGlnHisArgThrLysIle 
A7ACA7GGATGATT7G7ATG7AGGA7C7G AC T7 AG AAAT AG GGC AG C AT AG G AC A AA AAT 

2700 

GluLysLeuArgGluKisLeuLeuArgTrpG lyPheThrArg Fr oAspLys LysHisGln 
AGAGAAATTAAG AGAACATCTATTG AGGTGGGGATT7ACC AGACC AGATAAAAAAC ATCA 

LysGluProProPheLeuIrpMetG lyTyrGluLeuHisProAspLysTrpThrVa. LG La 
GAAAG AAC C C C C AT 77C TT7G G A7 G G G 77 ATG AAC 7C C AT C CTG AT AA ATG GACAG 7AC A 

2800 

SerlleLysLeuProGluLysGluSerTrpThrVa lAsnAspILeGlnAs nLeuValGlu 
G TC7A7AAAAC7GCC AGAAAAGG AG AG C7GGAC7GTCAATG ATATACAG AACTTAGTG GA 

ArgLeuAsnTrpAlaSerGlnlleTyrProGlylleLysVa lArgG InLeuCysLysLeu 
GAGATTAAAC TGGGCAAG CC AG AT TTAT C C AG GAATTAAAGTAAG ACAATTATGTAAACT 
2900 .... 
LeuArgGlyThrLysAlaLeuThrGluVa II leProLeuThrG luG luAl aG luLeuGlu 
CCTTAGGGGAACCAAAGC ACTAACAGAAGTAATACCACTAACAGAAGAAGCAGAATTAG A 

3000 

LeuAlaG luAsnArgG lul LeLeuLysGluProValHisGlyValTyrTyrAspProSer 
ACTGGCAGAAAACAGGGAAATTTTAAAAG AAC CAGTACATGGAG TGTATTATGACC C ATC 

LysAspLeulleAlaGluIleG InLysG InG lyHisG lyGlnTrpThrTyrG lnlleTyr 
AAAAGA.CTTAATAGC AGAAATAC AGAAAC AAGGGCACGGCC AATGGACAT ACC AAATTT A - 

3100 

GlnGluProPheLysAstiLeuLysThrGlyLysTyrAlaArgMetArgGlyAlaEisThr 
TCAAG AACCATTTAAAAATCTGAAAACAGGAAAGTATGCAAGAATGAGGGGTGCCCACAC 

AsnAspValLysGlnLeuAlaGluAlaValGlnArglleSerTbrGluSerlleVallle 
TAATGATGTAAAGCAATTAGCAG AGGCAGTGCAAAGAATATCCACAGAAAGCATAGTGAT 
3200 ■ . 

TrpG lyArgThrPr oLy sPheArgLeuPr oIleG InLy sG luThrlrpG luThrTrpTr p 
ATGGGGAAGG ACTC CTAAATTTAGACTACC CATAC AAAAGGAAACA7GGGAAACATGGTG 

3300 



FIG.7D 



AlaGiuTy r Tr pG lnA 1 aThr Tr p I lePr oG luTrpGluPhe?alAsnThrProProLeu 
GGCAGAGTATTGGCAAGCCACTTGGATTCCTGAGTGGGAATTTGTCAATACCCCTCCTTT 

ValLysLeuTrpTyrGlnLeuGluLysGluProIlelleGlyAlaGluThrPheTyrVal 
AGTAAAATTATGGTAC CAGTTAG AG AAGGAAC CCATAATAGGAG C AG AAAC TTTC TATG T 

3 40 0 

AspGlyAl aAl a A s nAr g G 1 u Th r L y s L e uG 1 y Ly s A 1 a G 1 y Ty r Va 1 Th r A s p A r 3 C I y 
AG ATG G GG C AG C TA AT AG AG AG AC TA A AT T AG G A AAAG C AG GATATGTTACTGACAGAGG 

ArgGlnLysValValProLsuThrAspThrThrAsnGlnLysThrGluLeuGlnAlalle 
AAG AC AG AAAG TTGTCC C XT TG AC TG AC ACG AC AA AT C AG AAG ACTG AG TTAC A AG C AAT 
3500 .... 
AsnLeuAlaLeuGlnAspSerGlyLeuGluValAsnlleValThrAspSerGlnTyrAIa 
TAATC TAG C C TTG C AG G ATTCGGG AT T AG A AG T A AAC AT AGTAAC AG AT T C A C A AT ATG C 

3600 

LeuGlyllelleGlnAlaGlnProAspLysSerGLuSerGluLeuValAsnGlnllelle 
ATTAGGAATCATTCAAGC AC AAC C AG ATA AG AG TG AAT C AG AG TT AG T C A ATC A A AT AAT 

GluGlnLeuIleLysLysGluLysValTyrLeuAlaTrpValProAlaKisLysGlyll- 
AG AGC AG TTAATAAAAAAG GAAAAG GTTTACC TG GC ATGGGTAC C AG C A C AC AAAG G AAT 

3700 

GlyGlyAsnGluGlnValAspLysLeuValSerGlaGlylleArsLysValLeuPheLeu 
TGGAGGAAATG AAC AAGTAGATAAATTAG TCAGTCAAGGAATCAG GAAAGTACTATTTTT 

AspGlylleAspLysAUGlaGluGluHisGluLysTyrHisAsnAsnTrpArgAlaMe: 
G GATG GAATAG ATAAGG CTCAAG AAG AAC ATG AGAAATATCAC AAC AATTG GAG AG C AAT 
3800 .... 
AlaSerAspPheAsnLeuProProValValAlaLysGluIleValAlaSerCysAspLys 
G GCT AG TG AT TTT AAC CTACCACCCGTG GTAG C AAAAG AAATAGTAG C TAG CTG TGATAA 

3 900 

CysGlnLeuLysGlyGluAlallecHisGlyGlnValAspCysSerProGlylleTrpGla 
ATGTC AG CTAAAAGGAG AAGCC ATGC ATGGACAAGT AGACTGTAGTC CAGGAATATG GCA 

LeuAspCysThrHisLeuGluGLyLysVallleLeuValAlaValHisValAlaSerGly 
ATTAGATTGTACACACTTAGAAGGAAAAGTTATCCTGGTAGCAGTTCATGTAGCCAGTGG 

4000 

TyrlleGluAlaGluVallleProAlaGluThrGlyGlaG luThr A I aTy r PheLeuLeu 
CTA TATAG AAG CAG AAGTTATTCCAG CAG AAACAGGGCAGGAAAC AGC ATATTTTCTTTT 

Ly sLeuAlaG lyArgTr pPro Va 1 Ly s Va IValHis Thr As pAs nG lySerAsnPheThr 
AAAATTAGCAGGAAGATGGCCAGTAAAAGTAGTACATACAGACAATGGCAG CAATTTCAC 
4100 .... 
SerAlaAlaVaXLysAlaAlaCysTrpTrpAlaGly IleLysG InGLuPheGlyllePro 
CAGTGCTGCAGTTAAGGCCGCCTGTTGGTGGGCAGGTATCAAACAGGAATTTGGAATTCC 

4200 

Tyr AsnPr oGlnSerGlnGly Val ValGluSerHecAsnLy sGluLeuLy sLy s Ilelle 
CTAC AATCCCCAAAG TCAAGGAGTAGTAGAATCTATG AATAAAGAATTAAAGAAAATTAT 

GlyGlnValArgAspGlnAlaGluHisLeuLysThrAlaValGlnKetAlaValPhelle 
AGGACAGGTAAG AG AT CAAG CTG AACATCTTAAG AC AGCAGT ACAAATG G C AG TAT T CAT 

4300 

HisAsnPheLysArgArgArgGlylleGlyGlyTyr SerAl aG lyGluArgl lelleAsp 
CCACAATTTTAAAAGAAGAAGGGGGATTGGGGGATAC AGTGCAGGGGAAAGAATAATAGA 



FIG. 7E 



IlelleAlaThrAspIleGlnThrLysGluLeuGlnLysGlnllelleLysIleGlnAsn 

CATAATAGCAACAGACATACAAACTAAAGAATTACAAAAACAAATTATAAAAATTC/«\AA- 

/. a n n 



4400 .... 
PheArgValTy rTy r A r g A s p S e r A r g A s o? r o I 1 eTrpLysG lyProAlaLy sLeuLeu 
TTTTCGGGTTTATTACAGAG ACAGCAG AG ATC CAATTTGGAAAG G AC C AG C AAAG C T C C T 

4500 

TrpLysGlyGluGlyAlaValVallleGlnAspLysSerAspIleLysValValProArs 
CTGGAAAGGTG AAGGGGC AGTAG TAATAC AAG AC AAGAGTG AC ATAAAGGTAGTAC CAAG ° 

ArsLysValLysIlelleArgAs p fyrG lyLysG lalletAlaG LyAspAs pCysVa 1A1 a 

/He CG luAs nAr gTrpG lnVallSetlleValTrpCU 
AAG AAA AG T AA AG AT TA TTAG G G AT TATG G AAAAC A G ATG G CAGGTGATGATTGTGTGGC 
POL<£-». . 4600 . . 

SerArgGlnAspGluAspl 
VaLAspArgllecArglljeLysThrTrpLysSerLeuValLysHisHi^MetTyrValSer 
AAG TAG AC AG GATG AG G ATCTAAAAC ATG GAAAAG TTTAG TAAAAC ACC AT ATG T ATG T T T 

L y s L y s A 1 a A s n A r g T r p ? h e T y r A r 3 H i s H i s T y r G 1 u S e r ? r o E L s P r o L y s 1 1 e S e r 
CAAAGAAAGCTAAC AGATGGTTT7ATAG ACATCx\CTATGAAAGCCCCCACCCAAAAATAA 
4700 .... 
SerGluValHisIleProLeuGlyG luAl aAr gLeu Va UleLysThrTyrTrpGlyLeu 
GTTC AG AAG TAC AC AT C C C AC TAG GAG AAG C TAG AC TGGTAAT AAAAACAT AT TG G GGT C 

4S00 

HisThrGlyGluArgGluTrpKisLeuGlyGlnGlyValSerlleGluTrpArgLysArg 
TG CAT AC AG GAG AAAG AG AATGG CATC TG G G T C AG GG AGT CT C C AT AG AAT G G AG GAAAA 

ArgTyrSerThrGlnValAspProGLyLeuAlaAspGlaLeuIleHisI-ietTyrTyrPhe 
GG AG ATA TAG C AC AC AAG TAGACCCTGGCC TG G C AG AC C AAC TA ATTC AT ATG T A T T A T T 

4900 

AspCysPheSerGluSerAlalleArgLysAlalleLeuGlyAspIleValSerProArs 
TTGATTGTTTTTCAGAATCTGCTATAAGAAAAGCC ATATTAGGAGATATAGTTAGTCCTA 

CysCluTyrGlnAlaGlyKisAsnLysValGlySerLeuGlnTyrLeuAlaLeuThrALa 
CCTGTGAGTATCAAGCAGGACATAACAAGGTAGGATCCCTACAGTATTTGGCACTAACAG 
5000 .... 

LeuIleAlaProLysGlnlleLysProProLeuProSerValArgLysLeuThrGluAsp 
CATTAATAG CAC CAAAACAG ATA AAG CC AC CTTTG CC TAGTGTTAG GAAG CTAACAG AAG 
r—^> R • . - . . 5100 

Vie tGluG InAlaProAlaAspGlnG lyPr oG lnArgG luProTyrAsnGluTrpAla 
ArkTrpAsnLy sProG InG ltiThr Ar gG 1 y H i s ArgG L y S e r H i s Tbr Me t As nG 1 y H i s 
ATAGp.TGGAACAAGCCCCAG CAGAC CAG GGGCCAC AGAGGGAGCC ATACAATGAATGGGC 
Q <t— . ...... 

LjeuGlaLeuLeuG luGluLeuLysSerG luA laValArgHisPheProArglleTroLeu 
ATJTAGA.GCTTTTAG AGGAGCTTAAGAGTG AAGCTGTTAGACATTTTCCTAGGATATGGCT 

5200 

HisSerLeuGlyG InHisIleTyrG luThr Ty rG lyAs pThrTr p Va 1G 1 y Va 1 G 1 u A 1 a 
C CATAG CTTAG G AC AAC ATATTTATG AAACTTATG G GGATAC CTGGGTAGGAG TTGAAG C 

IlelleArglleLeuGlnGlnLeuLeuPhelleEisPheArglleGlyCysGlnEis Ser 
TATAATAAGAATACTGCAACAATTACTGTTTATTCATTTCAGAATTGGGTGTCAACATAG 
5300 . p?>S . R *<H 

ArglleGlyll e I 1 e Ar gG InAr g Ar g A 1 a Ar g^s nG 1 y S e r S e r Ar g S e r 

KetAspProValAspProAsnLeuGlu 



CAGAATAGGCATTATTCGACAGAGAAGAGCAAGA/ ATGGATCCAGTAGATCC 



TAAC C TAG 
5400 



FIG. 7F 



ProTrpAsnHisProGlySerGlnProArgThrProCy sAsnLysCy sHisCy sLysL 
AGCCCTGGAACCATCCAGGAAGTCAGCCTAGGACTCCTTGTAACAAGTGTCATTGTAAAA 

CysCysTyrHisCysPro ValCysPheLeuAsnLy sG lyLeuG Ly II eSerTyrG LyA 

AGTGTTGCTATCATTGCCCAGTTTGCTTCTTAAACAAAGGCTTAGGCATCTCCTATGGCA 

5500 

LysLysArgArgG lnArg ArgG 1 y ? r o ? r o G 1 aG LyClyGlaAlaEisG InVa IPro I 
GGAAGAAGCGGACACACCGACGACGACCTCCTCAAGCCGCTCAGGCTCATCAAGTTCCTA 
ProLy sG lnl 

TA C C AA AG CAGfTAAGTAGTACATGTAATG C AAC C TTTAG GGATAATAGCAATAGCAGCAT 

5600 . . - - 

TAG TAG TAG C AA TAATAC TAGCAATAGTTGTGT G G AC C ATAG T AT TC AT AG A AT AT AG A A 

5 7 00 

GG AT AAAAAAG C AAAGGAG AAT AG A C TG TTTACT TG ATAG AATAAC AG AAAG AG C AG A AG 
P*ENV • 

IMetArgAlaArgGlylleGluArgAsnCysGlnAsnTrpTrpLysTrpGly 
AC AG TG G C AATG AG AG C G AG GGGGAT AG AG AG AAAT TG T C AAAAC TG GTGGAAATGGGGC 

5S00 

IleHetLeuLeuGlylleLeuIIetThrCysSerAlaAlaAspAsnLeuTrpValThrVal 
ATCATGCTCCTTGGGATATTGATGACCTGTAGTG C TG C AG AC AATC TG TG GGTC AC AG TT 

TyrTyrGly Va 1 P r o Va 1 T r p L y sG 1 u A 1 a Thr Th r Th r L eu P h e C y s A 1 a S e r A s p A 1 a 
TATTATGGGGTGCCTGTATGGAAGGAAGCAACCACCACTCTATTTTGTG CATCAGATGCT 
5900 .... 
LysSerTyrGluThrGluAlaHisAsnlleTrpAlaThrHisAlaCysValProThrAsp 
AAATCATATGAAACAGAGGCACATAATATCTGGGCCACACATGCCTGTGTACCCACGGAC 

6000 

ProAsnProGlnGLuIleAlaLeuGluAsnVaLThrGluAsnPheAsiiMetTrpLysAsn 
C C C AA C CCA CAAGAAATAG CAC TG GAA AATG TGA C AG AAAAC TTTAACATG TGGAAAAAT 
*••••. 
Asnllet Va 1G luG InMe tHisG luAspIlelleSerLeuTrpAspGlnSerLeuLysPro 
AACATGGTGGAACAGATGCATGAGGATATAATCAGTTTATGGGATCAAAG CCTAAAACC A 

6100 

Cy s ValLysLeuThrProLeuCy s Va IThrLeuAsnCy sSerAspG luLeuArgAsnAsn 
TGTG TAAAATTAACC CCACTC TG TG TCAC TTTAAACTG TAG TGATGAATTG AGGAACA AT 
• • • . . . 

GlyThrMecGlyAsnAsnValThrThrGluG luLysG lylie tLy sAsnCy sSerPheAsn 
G GCACTATGGG GAAC AATGTCAC TAC AG AG GAG AAAGG AATG AAAAACTG CTCTTTCAAT 
6200 .... 
ValThrThrValLeuLysAspLy sLysG InG 1 nVa lTy r Al aLeuPheTy rArgLeuAsp 
GTAACCACAGTACTAAAAGATAAGAAGCAGCAAGTATATGCACTTTTTTATAGACTTGAT 

6300 

lie ValFroI leAspAsnAspSer S erThr As n S e r Thr As nTy r Ar g Le u I 1 e As nC y s 
ATAGTACCAATAGACAATGAT AGTAGTAC CAATAGTACCAATTATAGGTTAATAAATTGT 

AsnThr SerAlalleThrGlnAUCysProLys ValSerPheGluProIleProI leKis 
AATACCTCAGCCATTACACAGGCTTGTCC AAAGGTATCCTTTG AGCCAATTCCCATACAT 

6400 

TyrCy s Al aP r o Al aG ly Ph e A 1 a 1 1 e LeuLy s Cy sAr gAs pLy sLy sPheAsnG lyThr 
TATTGTGCCCCAGCTGGTTTTGCGATTCTAAAGTGTAGAGATAAGAAGTTCAATGGAACA 

GlyProCy s Th r As n Va 1 S e r Th r Va 1G InCy sXhrHisG 1 y I 1 eA r gP r o Va 1 ValSer 
GGCCCATGCACAAATGTCAGCACAGTACAATGTACACATGGAATTAGGCCAGTGGTGTCA 
6500 .... 
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ThrGlnLeuLeuLeuAsnG LySerLeuAlaG luG fu Glu Valll ell eArgSerGluAsn 
\CTC AACTGCTGTTGAATGGCAGTCTAGC AGAAGAAGAGGTCATAATTAGATCCGAAAAT 

6600 

LeuThrAsnAsnAlaLy sAsnllell eAUHisLeuAsnG luSerValLysIleThrCys 
CTC ACAAAC AATG CTAAAAACATAATAGC AC ATCTTAATG AATCTGTAAAAATTACCTGT 

AlaAr-ProTyrGlnAsnThrArsGlnArgTtirProIleGlyLeuGlyClnSerLeuTyr 
GC A -\G G C C CTAT C AAA A TAC AAG ACAAA3AACAC C TATAG GAC TAG GG CAAT C A C T C X AT 

6700 

T h r T h r A r ~ S e r A r g S e r 1 1 e 1 1 eG 1 y G 1 nA 1 aH i s C y s A s n 1 1 e S e r Ar s A 1 a G 1 n 7 r p 
AC TAC AAG AT C AAG AT C AATAATAG G AC A AG C AC AT TG TAATATTAG TAG AG C A C AATG G 

SerLysThrLeuGlnCUValAlaArgLysLeuGlyThrLeuLeuAsnLysThrllelle 
AG TAAAAC TT T AC AAC A AG T AG C T AG AAAATTAGG AAC C CTTC TTAACAAAACAAT AATA 

6000 . ' • 

LysPheLysProSerSerGlyGlyAspProGluIleThrThrEisSerPheAsnCysGiy 
AAG TTTAAAC C AT C C TCAGG AG GGGAC C C AG AAATTAC AACAC AC AGTTTTAATTG TG GA 

6900 

GlyGluPhePheTyrCysAsnThrSerGlyLeuPheAsnSerThrTrpAsnlleSerAla 
G GGG AATTC TTC TAC TG TAAT ACATCAGG AC TG TTTAATAG TAC ATG GAAT ATTAG TG C A 

TrpAsnAsnileThrGluSerAsnAsnSerThrAsnThrAsalleXhrLeuGlaCysArg 
TGG AATAATAT TACAG AG TC AAATAAT AG C AC AAAC AC AAACATCACAC TCCAATG C AG A 

7000 - - 

IleLy sGlnllelleLysIletValAlaGlyArgLysAlall eTy r AlaPr oPr o I 1 eG lu 
AX A A AAC A A AT TAT AAAG ATGGTG G CAG GC AG GAAAG C AAT AT ATG CCCCTCC T ATC G AA 

Ar-AsnlleLeuCy s S e r S e r A s n I 1 e Thr G 1 y L e uL euLeuT hr ArgA s pG 1 y G ly I le 
AG AAAC AT TCTATG TTC ATC AAATATTAC AG GGCTACTATTGACAAGAG ATGGTG G TATA 
7100 .... 
AsnAsnSerThrAsnG luThr?heArg?roGlyGlyGlyAspHe tArgAs pAsnTrpArg 
AATAATAG TACTAACGAGACCTTTAGACCTGGAG G AG GAG AT ATG AG GGA CAAT TGG AG A 

7200 

SerGluLeuTyrLy sTyrLy s Val Val G lull eGluProLe uGly ValAlaP roThr Arg 
AGTGAATTATATAAATATAAGGTAGTACAAATTG AACCACTAGGAGTAGCACCCACCAG G 

AlaLysArgArgValValGluArgGluLysArgAlalleGlyLeuGlyAlaKetPheLeu 
GC AAAG AG AAG AGTGGTG GAAAG AGAAAAAAG AGC AATAGGATTAGGAGCTATG TTC C TT 

7300 

GlyPfceLeuGlyAlaAlaGlySerThrlle tG 1 yA 1 a Ar g S e r 7a 1 Thr LeuThr V a 1 G In 
GGGTTCTTGGGAGCAGCAGGAAGCACGATGGGCGCACGGTCAGTGACGCTGACGGTACAG 
• . • • • * 

AlaAr»GlnLeuMet SerGly I leValGlnG InG InAs nAsnLeuLeuArgAlalleGlu 
G CC AGAC AATTAATG TCTGGTATAG TG C AACAG C AAAACAATTTG C TGAGGGCTATAG AG 
7400 . 

AlaG InGlnEisLeuLeuG InLeuThrVa lTrpG ly IleLysGlnLeuGlnAlaArgI le 
GCGCAAC AGCATCTGTTGCAACTCACGGTCTGGGGCATTAAACAGCTCCAGGCAAGAATC 

7500 

LeuAlaValG luAr sTy r L euL y s As pG InG 1 nL euL euG ly 1 1 s Tr pG 1 y Cy s S e r G iy 
CTGGCTGTGGAAAGATACCTAAAGGATCAACAGCTCCTAGGAATTTGGGGTTGCTCTGGA 



FIG. 7ft 



LysHisIIeCy sf hr ThrAs n Va 1 P r oTr p As nS e r SerTrpSerAsaifrgSerLeuAsn 
AAACACATTTGCACCACTAATGTGCCCTGGAACTCTAGTTGGAGTAATAG ATC TCTAAAT 

7600 

GluIleTrpGlnAsnMetThrTrpKetGluTrpGluArgGluIleAspAsnTyrThrGly 
GAG ATTTG G CAGAACATGACCTG GATGGAGTG GG AAAG AG AAATTGACAATTACAC AG GC 

LeuIIeTyrSerLeuIleGluGluSerGlnThrGlnGlnGLuLysAsnGluLysGluLeu 
TTAATATATAGCTTAATTG AGGAATCGC AGACCCAGC AAG AAAAG AATG AAAAAG AATTG 
7700 .... 
LeuGluLeuAspLysTrpAlaSerLeuTrpAsaTrpPheSerlleThrGlnTrpLeuTrp 
TTGG AATTG GAC AAG TGGG C AAGTTTG TGGAATTG G TTT AG C AIAAC AC AATGG C TG TG G 

7 800 

TyrrieLysIlePhellelletllelLeGlyGlyLeuIleGlyLeuArglleValPheAla 
TATATAAAAATATTCATAATGATAATAG GAG GCTTG ATAGGTTTAAG AATAG TTTTTG CT 

ValLeuSerLeuVa iAsnArgVa lArgG InG lyTyr SerProLeuSerPheG InThrLeu 
G TG C TTT C TTT AG T A AATAG AG TTAGGC AGGGAT ACTCAC CTCTG TCG TTTC AG ACC CTC 

7900 

LeuProAlaProArgGlyProAspArgProGluGlyThrGluGluGluGlyGlyGluArg 
CTCC C AGCCCCGAGGGGACCCG AC AG GCCCGAAGGAACAGAAGAAGAAGGTG GAG AG CG A 

GlyArgAspArgSerValArgLeuLeuAsnGlyPheSerAlaLeuIleTrpAspAspLeu 
GGCAGAGACAGATCCGTGAGATTGCTG AACGGATTCTCGGCACTTATCTGGGACGACCTG 
8000 .... 
ArgSerLeuCysLeuPheSerTyrHisArgLeuArgAspLeuIleLeuIleAlaValArg 
CGGAGCCTGTGCCTCTTCAGCTACCACCGCTTGAGAGACTTAATCTTAATTGCAGTGAGG 

8100 

IleValGluLeuLeuG lyArgAr gG lyTrpAspIleLeuLysTyrLeuTrpAsnLeuLeu 
ATTG TAG AACTTCTGGG A CG CAG G GG GTGGG AC ATC CTC AAAT ATC TG TGGAATCTCCTA 

GlnTyrTrpSerG InG luLeuArgAsnSerAlaSerSerLeuPheAspAlalleAlalle 
CAGTATTGGAGTCAGGAACTGAGGAACAGTGCTAGTAGCTTGTTTGATGCCATAGCAATA 

8200 

Ala 7a 1 Al aG luGlyThrAspArgVal IleGluIlelleGlnArgAlaCy sArgAlaVa 1 
GCAGTAG CTGAGGGGACAGAT AGAGTTATAGAAATAATACAAAG AGCTTGCAGAGCTG TT 

W . 

LeuAsnll eProArgArgl UArgGlnGlyLeuGluArgSerLeuLeuj r~* F 

(KetGlyGly 

CTTAAC ATAC CCAGAAGAATAAG ACAGGGCTTAG AAAGGTCTTTACT TTAAAATGGGTGG- 
8300 .... 
LysTrpSerLy sSer Ser I leValGlyTrpProAlalleArsGluArglleArgArgThr 
CAAATGGTCAAAAAGTAGTATAGTGGGATGGCCTGCTATAAGGGAAAGAATAAGAAGAAC 

8400 

AsnProAlaAUAspGlyValGlyAlaValSerAtsAspLeuGluLysEisGlyAlalle 
TAATC CAGCAGCAG ATGGGGTAGGAG CAGTATCTCGAG ACCTGGAAAAACATG GGGCAAT 

Thr Ser Ser AsnThr Al aSer ThrAs nAlaAspCy sAlaTrpLeuG luAlaG InG luGlu 
CACAAGTAGCAATACAGCAAGTACTAATGCTG ACTGTGCCTGGCTAGAAGCAC AAGAAG A 

8500 

SerAspGluValG ly PheP r o Va lAr gP r oG InVa IP r oLeuAr gP r olle t ThrTy rLy s 
G AGCGACGAG GTGGGCTTTCCAGTCAGACCCCAGG7ACCTTTAAGACCAATGACTTACAA 



U3 

G luAlaLeuAspLeuSerHisPheLeuLy sG luLysGlyGljyLeuGluGlyLeuIleTrp 
AGAAGCTCTAGATCTCAGCCACTTTTTAAAAGAAAAGGGGGGkCTGGAAGGGCTAATTTG 
8600 . 1 
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SerLysLysArgGlnGluIleLeuAspLeuTrpValTyrAsnThrGlnGlyllePhePro 
G TC CAAAAAG AG AC AAGAG ATC CTTG AT CTTTGGGTCTAC AAC ACAC A AG GCATCTTCCC 

37 00 

AspTrpGlnAsnTyrThrProGlyProGlylleArgTyrProLeuThrPheGlyTrpCys 
TGAT TG G C AAAAC TAC AC AC C AG GG C CAG GG AT C AG AT ATC C ACTA AC CTTTGGATGG TG 

TyrGluLeuValProValAspProGlnGLuValGluGluAspThrGluGi'yGluThrAsn. 
CTACGAG CTAGTACCAGTTGATCCACAGGAGGTAGAAGAAGACACTGAAGGAGAGACCAA 

8800 

SerLeuLeuEisProIleCysGlnEisGlyMetG luAs pProGLuArgGlnValLeuLys 
CAGCTTGTTACACCCTATATGCCAGCATGGAATGGAGGACCCGGAGAGACAAGTGTTAAA 

TrpArgPheAsnSerArgLeuAlaPheG luRisLysAl aArgG LuHe tHisProG luPhe 
ATG G AG ATTTAAC AG C AG AC TAG C ATTTG AG C AC A AGGCC CGAG AG ATGCATC CGG AG TT 
S900 .... 

Ty rLysAsn 

CTAC AA AAACTG ATG AC AC CGAG CTTTCTACAAGGGACTTTCCGCTGGGGACTTTCCAGG 

9000 

GAGG CG TG GAC TGGG CG G GAC TGGGGAG TG G CTAAC CCT CAGATGCTGCATATAAG C AG C 
U3 & p*R 

TG CTTTT TG C C TGTAC TGJGGTCT CTC TG G TTAG AC CAG ATTTG AGCCTGGGAGCTCTCTC 

9100 . B&-i 

GCTAGCTAGGGAACCCACTGCTTAAGCCTCAATAAAGCTTGCCTTGAGTGCTTCAA | 



